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Abstract. In majority bootstrap percolation on a graph G, an 
infection spreads according to the following deterministic rule: if at 
least half of the neighbours of a vertex v are already infected, then 
V is also infected, and infected vertices remain infected forever. 
Percolation occurs if eventually every vertex is infected. 

The elements of the set of initially infected vertices, A C V{G), 
are normally chosen independently at random, each with proba- 
bility p, say. This process has been extensively studied on the 
sequence of torus graphs [n]'^, for n = 1, 2, . . ., where d = d{n) is 
either fixed or a very slowly growing function of n. For example, 
Cerf and Manzo [T^ showed that the critical probability is o(l) if 
d{n) ^ log, n, i.e., \ip = pin) is bounded away from zero then the 
probability of percolation on [nY tends to one as n ^ oo. 

In this paper we study the case when the growth of d to oo is not 
excessively slow; in particular, we show that the critical probability 
is 1/2 + 0(1) if d ^ (loglogn)^ logloglogn, and give much stronger 
bounds in the case that G is the hypercube, [2]''. 



1. Introduction 

Consider a finite graph G with two parameters, and r„, attached 
to each vertex t>, with q^+r^ greater than the degree of v. Suppose each 
of the vertices may take either one of two states, 'active' and 'inactive', 
say. At each instant, some of the vertices may 'wake up' at random; 
whenever a vertex v does so, if at least q^ of its neighbours are active 
then it becomes active, if at least r„ of its neighbours are inactive then 
it becomes inactive, and if neither of these cases holds then v keeps 
its state. Given an initial distribution of active sites, one can then ask 
what happens to the system in the long run. 

For example, we may take G to be a (i-regular graph with d odd, 
and parameters q^ = = {d + l)/2 for every vertex v. in this case. 
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whenever a vertex wakes up, its state becomes that of the majority 
of its neighbours. For a ci-regular graph with d even, we may take 
Qv = "^v = d/'^ + ^ for every vertex v. 

A special (and very well-studied) example of this process is the zero- 
temperature Ising model, where the process occurs on the lattice U^. 
Despite all this interest, however, only rather weak (though far from 
easy) results have so far been proven (see [17] , [23] and [23] , for exam- 
ple) . Another example is the celebrated model of the brain, introduced 
over 60 years ago by McCuUoch and Pitts [22] , and shown by them to be 
complex enough to include a universal Turing machine as a particular 
case. 

In this paper we shall study a simpler model, introduced by Chalupa, 
Leith and Reich [T5| in 1979, in which the vertices may only change 
state in one direction, from inactive to active, say. In order to make 
this easier to remember, let us refer instead to 'healthy' and 'infected' 
vertices, so that a healthy vertex may be infected, but infected vertices 
never recover. 

More generally (and more precisely), let G be a finite graph, let 
r G N, and let A C V{G) be a set of initially infected vertices. The 
set of infected vertices is updated as follows: if a healthy vertex has at 
least r infected neighbours, he becomes infected; otherwise he remains 
healthy. In other words, we have a sequence of sets 

A = C a(^) C . . . C A^""^ C ... 

where A^"'+^^ = A^™) U {v e V{G) : \T{v) H A^™)] ^ r}. If the entire 
graph is eventually infected, i.e., A^"^^ = V{G) for some m G N, then 
the set A is said to percolate on G. This process is known as the r- 
neighhour bootstrap percolation on G; if G is rf-regular with d odd and 
r = \d/2'], then this is the majority bootstrap percolation. We remark 
that this process is very different from the random majority bootstrap 
process studied in [4j, and also the biased majority process studied 
in [26j, since in each of those processes vertices may change states in 
both directions, i.e., may be infected and then later healed. 

The bootstrap process has been well-studied in the case that G is 
the torus [nY, where r and d are both fixed (with r ^ d), the elements 
of the set A are chosen independently at random, and n — >■ cxd. (In 
fact the results below were proved for the grid (i.e., the subgraph of 
Z'^ induced by the vertices of [n]"') but the proofs also apply to the 
corresponding tori. Note that for n = 2 the grid is (i-regular, but for 
n ^ 3 the degrees vary from d to 2d.) 

Let p = p{n) = F{v G A) for each v G V{G), and write Pp for the 
corresponding (product) probability measure. Clearly the probability 
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that A percolates is monotone in p, since extra initial infections can 
only make percolation more likely. Hence there exists a (unique) value 
Pc G (0, 1), depending on n, d and r, for which P(A percolates) = 1/2. 
We call this value the critical probability, and in general define 



The challenge is to determine the value of pc, and also the size of 
the critical window, i.e., the range pi-e — Pe, where pa satisfies Pp^(v4 
percolates) = a for each a G (0, 1), and e ^ 0. In general we would 
like to show that the window is small, and hence that the threshold for 
percolation is 'sharp'. 

The first rigorous results for bootstrap percolation on finite graphs 
were obtained by Aizenman and Lebowitz |3], who showed that 

for some functions < c{d) < C{d), and moreover that the size of 
the critical window for [n]'^ is 0{pc) when d is fixed and r = 2. The 
problem when 3 ^ r ^ d seems to be more difficult (when r > d we 
have Pc = I — o(l), so the problem is less interesting). Despite this, 
Cerf and Manzo [H] , building on work of Cerf and Cirillo [13| (as well 
as the older work of Schonmann [25] on the lattice Z*^), were able to 
prove the corresponding result for all fixed d and r. They proved that, 
if r ^ d, then 

c{d,r) ,1 ^d ^ ^ C{d,r) 



^ Pc{[nY,r) ^ 



{log,_^nY-'+^ ^ ^'^^ J ' / - (log,_i n)'^--+i ' 

for some functions < c{d,r) < C{d,r), where \og^_in is the (r — 1)- 
times iterated logarithm, i.e., log^ n = logn and log;,.,.^ n = log(log;, n) 
for each k eN. In particular, their proof implies that Pc{[nY, d) = o(l) 
if 1 <C (i ^ log^ n, where log^ n is the number k such that log^ n ^ 1 > 
^ogi._^_i n. In particular, note that log^ n -C log^ n for every k E N. 

Finally, we remark that in the case d = r = 2, Holroyd ^20j was able 
to prove a much sharper result: that in fact 

Pc{[n]^2) = — + o( — 

18 log 72 \logn^ 

(Here, and throughout, log is taken to the base e). The reader who 
is interested in bootstrap percolation on other types of graphs should 
also see the work of Balogh and Pittel p] on random d-regular graphs, 
and of Balogh, Peres and Pete on infinite trees. An application of 
the techniques of bootstrap percolation to the zero-temperature Ising 
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model may be found in [Uj, and for a brief survey of the physical 
applications of the bootstrap process, see [T]. 

In this paper we shall be interested in majority bootstrap percolation 
on very high-dimensional lattices, and more generally on arbitrary d- 
regular 'lattice-like' graphs. The fundamental result of Cerf and Manzo, 
stated above, has two drawbacks. The first is that the known bounds 
on c{d, r) and C{d, r) are rather far apart for large d and r, and the sec- 
ond that the theorem is only useful when n is extremely large (so that 
log^_^n > c{d,r)). We shall show that the latter problem is unavoid- 
able; indeed when n is not quite so large (at most 2^^^°^ , say) then the 
behaviour of Pc is quite different (for sufficiently large d) . We shall also 
study in more detail majority bootstrap percolation on one particular 
graph, the hypercube Qd = [2]*^, which may be thought of as an extreme 
point of the family {[n]"^}, where n = n{t) and d = d{t) are arbitrary 
functions. The hypercube is a very well-studied combinatorial object; 
for example, see the work (relating to a different percolation problem 
on the hypercube) of [16], [2] and [11], and the recent improvements 
of [12], [18] and [19]. In a subsequent paper [6] we shall investigate 
2-neighbour bootstrap percolation on high- dimensional lattices, and 
show that the problem essentially reduces to the equivalent question 
for the hypercube. As the reader will discover, a similar phenomenon 
also occurs for majority percolation. 

The structure of the remainder of the paper is as follows. In the next 
section we shall state our main results; in Section [3] we shall describe 
some of the simple tools that we shall use later; in Sections H] and [5] 
we shall prove fairly strong bounds on pc{Qn,n/2)] in Section [6] we 
shall study more general d-regular graphs; and in Section [7] we shall 
describe some open problems and conjectures. The paper ends with an 
appendix in which the tools from Section [3] are proved. 

2. Main Results 

In this section we shall state our main results. We begin with a 
theorem which gives bounds on the critical probability for majority 
bootstrap percolation on the hypercube. Recall that the n-dimensional 
hypercube, Qn = [2]", is the n-regular graph with vertex set V{Qn) = 
{0, 1}", and edge set E{Qn), where 

xy e E{Qn) if and only yi]\ = 1. 

Observe that the vertices of Qn may also be thought of as subsets of 
[n], in which case xy G E{Qn) if and only if \x/\y\ = 1. 
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We shall assume throughout that n is even, so that we may write 
n/2 instead of \n/2]. However, all our proofs are also valid for n odd. 

Theorem 1. Let n eN, X eM, 



1 1 /logn Aloglogn 

2 2y n ^ynlogn 

and let the elements of A C. V{Qn) be chosen independently at random, 
each with probability p. Then in majority bootstrap percolation, 



F{A percolates on Qn) 
as n oo. In particular. 



if A ^ -2 

1 if A > 1/2 



. .1 1 /logn 21oglogn 

Pc[Qn,n 2) > - - oV / , 

^ 2 2 \ n vnlogn 

for sufficiently large n, and 



, .1 1 /logn log logn / log logn 



2 2 V n 2^n logn \2Vn logn 
as n oo. 

Note that Theorem [T] determines the first two terms in the expansion 
of Pc {Qn, '"■/2) , but not even the order of the third, since it does not tell 
us whether or not A is likely to percolate when A = 0. In Section [5l we 
shall discuss how these bounds might be further improved, and explain 
the limitations of our method. 

We next turn to more general d-regular graphs on N vertices. As it 
turns out, the method of the proof of Theorem [T] can be adapted, in 
a slightly simpler form, to deal with a wide range of such graphs. As 
usual, we write T{u) for the set of neighbours of a vertex u G V{G). 
Furthermore, for each u G V{G) and /c G N let us define 

S{u,k) = {veV{G) : d{u,v) = k}, 

so that S{u, 1) = T{u), and also let B{u,k) = {v : d{u,v) ^ k}. The 
property of the hypercube which we shall need in order to prove the 
theorem below, is that the set S{u,k) fl T{v) is not 'too big' for any 
u G V{G), V G V{G) \ B{u, k — 1), and 'sufficiently large' k (the exact 
size needed depends on A^). 

The following theorem is somewhat technical, so on a first reading 
the reader may wish to assume that k and the functions fi are all 
constant. 
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Theorem 2. For each d e N, let N = N{d),k = k{d) e N, and 
let G = G{d) be a d-regular graph on N vertices. Furthermore, let 
cj, /, /i, . . . , //s : N — s> N be arbitrary functions satisfying 

1 ^ f^{d) ^ fid) = O ^ 



^ k log d 

and uj [d) oo as d (yo. Suppose that, for every G N and i & [k 

\S{x,z)nT{y)\ ^ f,{d) 
for every x G V{G) and y G V{G) \ B{x, i — 1), and 

d^ 



N ^ exp 



{U{d)k)\fu-M + f,{d)) Yltl f^{d) 



for every d E N. Then 



p.{G,d/2) = i + o(l) 



as d oo. 



We remark that the bound on N in Theorem [2] cannot be improved 
substantiaUy. An example showing this will be described in Section [61 
Other extensions are possible though; in particular, the bound on 
\S{x,i) n T{y)\ does not have to hold for every x G V{G): rather, a 
(small) exceptional set is permissible, since with high probability these 
vertices will be infected in the first round anyway. We shall not need 
this generalization however, and do not wish to further complicate the 
statement of the theorem unnecessarily. 

Our main motivation for proving Theorem [2] is the following, almost 
immediate corollary. Let [n]'^ denote the (i- dimensional n x . . . x n 
torus, and note that [nY is 2(i-regular, and has n'^ vertices. Moreover, 
[nY satisfies the conditions of Theorem [2] with fi{d) = i + l for each 
2 G N (see the 'Proof of Corollary [21 in Section [S]). Thus, we may apply 

the theorem as long as /c^ = o ( ), and n'^ ^ 2^''; doing so gives 

\\ogdJ 

the following result. 

Corollary 3. Let n = n{t) and d = d{t) be functions satisfying 

3 ^ n = 2^ 

or equivalently, d ^ £:(loglogra)^ logloglogn for some e > 0. Then 

1 



Pc{[nr,d) = - + o(l) 



as t ^ oo. 
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The proofs of Theorems [Hand [2] are not short, but the main ideas may 
be summarised in a few sentences. For the upper bound in Theorem [H 
we study the first two steps of the process in detail, and show that with 
high probabihty, at least (about) 3/4 of the vertices will be infected by 
this stage; the rest of the proof is then simply a matter of battling the 
weak dependence between the events {x G ^'•^-'}sey(Qn)- 

The upper bound is somewhat harder, since the process may continue 
running for many steps. We overcome this by introducing a new, 'more 
generous' process, which nonetheless stops quickly; a simple coupling 
then shows that the original process must also stop eventually, before 
it has infected the entire vertex set. The proof of Theorem [2] is similar, 
though in this case the process must be allowed to run for many more 
steps, since we are dealing with much larger vertex sets. However, since 
we only wish to prove a much weaker result (that Pc{G, d) = l/2+o(l)), 
the details of the calculations become much simpler. 

3. Tools 

Since all the results in this section will either be well known, or 
simple approximations of binomials, we shall postpone the proofs until 
the appendix. We begin by recalling the standard Chernoff bound (see 
[To] for example). 

Lemma 4. Let nEN,0<p<l,t^O and S{n) ~ Bin(n, p). Then 

(a) F{S{n) ^np + t) < exp 

and similarly, 

exp 

The following lemma gives an almost matching lower bound in the 
case in which we shall be interested. 

Lemma 5. Let C ^ be a constant, and n & N be sufficiently large. 
Let p ~ 2 ~ '"^^^^^ ^ ^ S^'^n ^ 1, and let S{n) ~ B{n,p). Then 

We next state a simple generalization of Lemma [U which will be a 
key tool in the proofs of the lower bounds in Theorems [1] and [2l 
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Lemma 6. Let t,k,di, . . . ,dk ^ N and p G (0, 1). Let Xi ~ Bm{di,p) 
for each i G [k], let Yk = J2i=i ^^'h (^'i^d let D{k) = J2i=i ^^"^i- Then 

F{Yk^E{Yk)+t) ^ {2t)''~'exp 

Now an easy approximation, will we shall use to prove the upper 
bounds in Theorems [1] and [2l 

Lemma 7. Let p G (0, 1) and n E N satisfy pv? ^ 1, and let S{n) ~ 
Bin(?T,,p). Then 

W{S{n) ^ m) ^ 2p™/2 

for every m G [n]. In particular, if c,e > and p ^ e^™, then, for 
some b = b{c, e) > not depending on n, 

F{Sin) ^ en) ^ e"^"'. 

In order to apply Lemma [71 we shall make frequent use of the fol- 
lowing simple lemma. 

Lemma 8. Let G be a graph, let k,m E N, and suppose that for each 
X G V{G), 

\B{x,k)\ = \{y eV{G) : d{x,y) ^k}\ ^ m. 

Then there exists a partition 

V{G) = BiU...UBm 

of V{G), such that if y,z G Bi, then d{y, z) ^ k + 1. 

Lemma [8] immediately implies the following result for hypercubes. 
We remark that below, and throughout the paper, we shall often write 
X G Q„ to mean x G V{Qn)- 

Lemma 9. Let n. A; G N, and x G Qn- Then there exists a partition 

S{x, k) = BiU ...UBm 

of S{x,k) into m ^ ^(^"1) ^ ^n'^'^ sets, such that if x,y G Bj for 
some j , then d{x,y) ^ 2k. 

We remark that Lemma [9] is a simple special case of the following 
old result, due to Baranyai p]. 

Lemma (Baranyai, 1973). Let Kn'''^ denote the complete h-uniform 
hypergraph on n vertices. If h divides n, then K^"* can be partitioned 
into 1-regular hypergraphs. 

We shall also need the following two, rather easy lemmas, and one 
trivial observation, which follows by the convexity of (2) . 
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Lemma 10. Let 6 = 6{n) ^ as n ^ oo, and let p = 6. 

Let S{n) ~ B{n,p) and let S'{n) ~ 1 + B{n — l,p). Then, for any 
^ m = m{n) ^ n/2, 

¥{S'{n)^m) = {l + o{l))¥{S{n)^m) 

as n oo. 

Lemma 11. Let 6 = 6{n) ^ as n oo, and let p = 6. Let 

S{n) = X + Y{n), where X ~ Bin(l,p) and Y{n) ~ Bin(n — 
Then, for any ^ m = m{n) ^ n/2, 

P(X(1) = 1 I ^ m) = (1 + o(l))P(X(l) = 1) 

as n oo. 

Observation 12. Let k,ai, . . . , ak, A eN, and suppose max{ai} ^ A. 

i 

Then 



E 



^ A 



Finally, we shall use the following, probably well-known lemma (see 
Lemma B.7), and we recall the standard second moment method 
(see [in], for example). 

Lemma 13. Let p G (0, 1), let n eN, and let S{n) ~ Bin(n,p). Then 

F{S{n) ^ [np\ - 1) ^1/2 ^ F{S{n) ^ \np]) . 
In particular, if e > and p = p{n) G [£, 1 — e] for every n E N, then 

F{S{n) ^ up) = l/2 + o(l) 

as n oo. 

Lemma 14. For any random variable X , and any a > 0, we have 

Var(X) 



f(^\X-E{X)\ ^ a|E(X)|) ^ 



a2E(X)2' 



In particular, if X^ is a sequence of non-negative random variables, 
such that Var(X„) = o(E(X„)^), then 

p/EpQ ^ ^ 2E(X„)^ = l-o(l). 



10 



JOZSEF BALOGH, BELA BOLLOBAS, AND ROBERT MORRIS 



4. Proof of the upper bound in Theorem \T\ 
Let n G N be sufficiently large, and let 



1 1 logn A log log n 
= 2 - 2V^ + 7^1^' 

for some A G R. We shall show that if A > 1/2, then for some constant 
c > and each vertex x G V, P(x ^ A^^^^) < e~^^ , and thus that 
P(A(ii) = V) = l-o(l). 

The proof comes in three stages: ffist we shall show that P(x G 
> 3/4 (Lemma [IS]); next we shall show that P(a; ^ A^^^) < e'""- 
for some constant c (Lemma [T6|) : and finally, we shall show that P(x ^ 
^(11)) < e-cn2 (Lemma [IZD. 

In all that follows, we shall assume that the elements of A^^^ are 
chosen independently, each with probability p, and that A*^*) are the 
infected vertices after i rounds. 

In fact, we need a slightly more general concept. Let A^^ denote 
the set of infected vertices after round i if the infection threshold is r. 
Thus A(^) = A^s- 

Lemma 15. Let x G V{Qn), and p and A^'^^ be defined as above. Let 
r ^n/2 + 100. If X > 1/2, then F{x G A^) ^ 3/4 + o(l) as n ^ oo. 

Proof. Let x G V{Qn), where n is large. With probability p = 1/2 + 
o(l), we have x G A^^\ For those x ^ A^'^\ we shall use Lemma E] to 
show that P(|r(x) n Ai^^\ ^ r) ^ 1/2 + o(l). 

So let us assume that x ^ A^^\ and let R = T(x) fl A^'^'^ and S = 
r(x)nA^.^^\AW. We want to show that P(|i?| + |5| ^ r) ^ l/2 + o(l). 
This follows easily from the following claim. 

Claim : Let 100 ^ m ^ n/2. Then 

p(i5i<^"°^;r'"'|w»^--.)-o(i). 

Proof of claim. We use the standard second moment method ( Lemma [T4 
Throughout the proof of the claim we assume that \R\ ^ r — m; in par- 
ticular, we shall sometimes write P(-) for P(- 1 ^ r — m). First we 
must bound the expected size of 5". By Lemma [5] we have, for each 
vertex y G T{x), 



P(|r(?/)nA(°)| ^r) ^ exp( -26^71-46 
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, . 1 1 /logn A log log n , . . 

where o = - — P = -\ , , (iNote that we must use 

2 2 V n y/n log n 

Lemma [5] with C = 101 here, since we assume x ^ AW.) Observe that 
26^n + 46,r^ = - 2Aloglogn + 2 + 0^ 



logn 2 \ logra 

and note that, by Lemma [TT| P(|/ ^ ^ r — m) ^ 1/3 if n is 

sufficiently large. Thus 

E{\S\\\R\^r -m) = n¥ {y ^ A^^'^ \\R\ ^ r - m)F{\T{y) n A^^'^\ ^ r) 



^ ^ / ox2 / n loglogn 

^ 7— r exp —2d n — 4:6 



3e^ \ Y logn 

V^(logn)(^^-^)/^ 

3^9 



since n is sufficiently large, so we can assume the term O ^ ''^"^og n"^ 
is at most 1. 

Now, we need to show that the variance is not too big. Consider 
two distinct vertices y,z & r(a^), and note that \T{y) fl T{z)\ = 2, 
and that F{y e S \ z e S) ^ Fiy e S). Let r{y) n r(^) = {w,x}. 
Let p' = F{y e A^o) | \R\ ^ r — m), and note that, by Lemma 
p' = (1 + o(l))p. Then 

P(y G 5 I 2 G 5, ^ r - m) ^ F{yeS\we A^^\ \R\>r- m) 

= P(y ^ I -m)¥{\{T{y)nA^^^)\{x,w}\ ^ r - l) 

= (1 -p') F{S'{n) ^ r), 

where >S"(n) ~ 1 + Bin(ri — 2,p). But 

¥{y e S\\R\^r-m) = {l - p')F{S{n) ^ r) , 

where S{n) ~ Bin(n — l,p). Thus, by Lemma [TUl 

¥{yeS\zeS) = (1 +o(l))P(y G 5). 

Finally note that E(|S|) ^ 00, and so Var(|5|) = o{E{\S\f). 

By LemmalH it follows that for any e > 0, \S\ ^ (1 -£)E(|5|) with 
high probability. In particular, 

p(i5i< ^''°^;r"-'" )<p(i5i<gi^)^o(i). 

as claimed. □ 
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/^nQg^V4A-l)/2 ^ 

Now, let m = , let R denote the event that l-Rl ^ 

r — m, and let S denote the event that 15*1 ^ m. Since A > 1/2, we have 
m ^ y/n logn, and so r — m < np = K{\R\) . Therefore, by Lemma [T3| 

P(^) ^ l/2 + o(l) 

as n oo. Moreover, by the claim we have 

F{S \ {x ^ A^^^) A R) = l-o(l). 

Thus 

P(x G 4^)) ^ P(x G + A ^ I X ^ P(x ^ 

= p + {l-p)F{R)F{S\{x A^^^) AR) 

as required. □ 
Remark 1 . Note that we only needed A > 1 /2 in order to show that 



m 



^ ^ ^^/^ ^ y/n logn ^ f--pjn + 100. 



Thus, our proof will actually give 



Pc{Qn,n/2) ^ - - -\ + ^ , , + O 



2 2\n l^n logn \ i/ra log n 

On first sight, it would seem that we are basically done, since in the 
third round essentially all of the remaining healthy vertices should be 
infected. Unfortunately, and crucially, however, we have lost indepen- 
dence. The next step, in which we go from most of the vertices to 
almost all of them, in fact turns out to be the most problematic. 

Recall that the elements of A are always chosen independently at 
random with probability p. 

Lemma 16. For each 5 > there exists a constant c = c{6) > such 

I log n 1 

that the following holds. Let n eN, and let p E I W , - ] be 



large enough so that, 




Then P(x G A^^^) ^ 1 - e" 
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Proof. We wish to estimate P(a; ^ A^^''); in order to do so, we must 
show that the probabihty that |r(a;) \ A^^'^] ^ n/2 is small. We begin 
with an important fact: 

If S C Qn satisfies d{y, z) ^ 5 for every y, z E S with y z, 
then, for any r, the events {y E Ai'^^}y(zs are independent. 

Let Y = S{x,3) = {y E Qn '■ d{x,y) = 3}, so \Y\ = (g), and let 
m = 3(2) ^ 2n^. By Lemma M there exist disjoint sets Bi, . . . , Bm 
such that [j Bj = Y, and for each j E [m], the events {y E A^'^^}y^Bj 
are independent. 

Now, let e > 0, and with foresight, observe that by Lemma HI 



p (|r(x) n 



n 



< - - 
2 



^ exp (— 5^£^n) 



(1) 



since 



p ] n <^ 6 e n. Let J{x) denote the event that this does 



not happen, i.e., that |r(a;) fl A^^'^] ^ n/2 — S'^e'^n, and assume from 
now on that J{x) holds. Now choose S C T{x) \ A^°^ with \S\ = n/2. 
We thus have at most ("^^^/2^ choices for S. We shall show that it 

is extremely unlikely that S D A^^^ = ^. 

Indeed, let T = r(5') \ {x}, so d{x, y) = 2 for every y E T, and note 
that T = Ti UT2, where Ti = {y E T : \r{y) nS\=i} for i = 1,2. Note 
further that \Ti\ = n^A and jTa] = ("f ). 

Let a = |Tinv4(3)| and 6 = iTaHA^^)], and suppose that SnA^'^^ = 0. 
Then a + 26 ^ n'^/4. (This follows by simply counting edges - each 
vertex of S sends at least n/2 edges to vertices not in A^^\) But, by 



Lemma HI since p E \ - 



logn 1 



n 



- , we have 
2 



P 



|TinA(°)| - ^ 



^ 2^/nFlogn\ ^ exp (—n logn) 



(2) 



and similarly 



P 



|T2nA(o)| - - 

16 



^ 2^/nF\ognj ^ exp (—n logn) . (3) 

Let F{S) denote the event that neither of these events occurs, i.e., that 
llTi n A^l - n^/8i\ ^ 2y^n^ lognfori = 1,2, so P(F(S)) ^ l-2n-'', 
and assume that the event F{S) holds for every S. 



So, with very high probability, if |T fl A^^^ \ | > logn then 

a + 2b > and thus S" fl A^^^ 7^ 0. Hence the following claim will 

(essentially) complete the proof. 
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Claim: 



P 



Proof of claim. Consider the bipartite graph i7, with vertex set 1^ UF, 

where W = T\A^^^ and Y = S{x, 3), and edge set {wy : wy E E{Qn)}. 
Furthermore, let us colour the edges of H red and blue, according to 
whether or not the cndpoint in Y is also in A^'^\ i.e., c(wy) = red if 
and only if y e ^4^^). 

3^2 

Now, e{H) = (n — 2)|1^|, and \W\ = h 0{\/n^\ogn), since we 

16 

assumed that ^(5") holds. Suppose \T n A^^^ \ ^(°)| = O (^^nMogn). 

Then only O (^■\/n^ log vertices of W have at least n/2 neighbours 
in A^'^\ and thus the number of red edges in H, eR{H), satisfies 

eR{H) ^ + Oi^nHogn) = + 0{\/nHogn), 

and thus 

eB{H) ^ - O(^A^l^) = ^ - O(aA^M^) 

also. Now, recall the partition Si, ... , of Y into independent sets, 
and define a refinement of it by setting 

Bjd) = {zeBj : |r(z)nr| = 

for each i e [3] and each j e [m]. Note that the sets Bj{i) are deter- 
mined by the set 5" and the partition Si, ... , B^- 

Subclaim : If £ > (chosen above) is small enough, then there exists 
an i e [3] and a j e [m] such that 

(a) i ^ Sen edges of H are incident with Bj{i), and 

(b) at most (1/2 + 6/2)i of these edges are red. 

Proof of subclaim. Suppose the subclaim is false. Let us count the total 
number of blue edges. From those Bj{i) with at most Sen incident 
edges, we get at most 9emn ^ ISen''' edges. From the others, we get 
at most e{H){l/2 - 6/2) ^ n\W\{l/2 - 6/2) blue edges. If e is small 
enough, this contradicts the bound on esiH) above. □ 

Now, recall that the events {y E A^'^^}y^Bj{i) are independent, and 
that we chose p so that F{y E ^ l/2+<^ for each y E Qn- Unfor- 

tunately, this event is not independent of which members of B{x, 2) are 



MAJORITY BOOTSTRAP ON THE HYPERCUBE 



15 



in A^^^; however, for each vertex y G Bj{i), y has at most three neigh- 
bours in B{x, 2). Considering the {n — 3)-dimensional sub-hypercube 
containing y but none of these neighbours, we see that, for any set 
B{x, 2) n we have F{y e A^^j^) ^1/2 + S. 

Let Ej{i) denote the event that the set Bj{i) satisfies the conditions 
(a) and (6) of the subclaim, and note again that each vertex in Bj{i) 
sends z ^ 3 red edges into T. Thus, by Lemma HI 

Sen 1 ^\ /I 6\ Sen 
^ - + - 



^ 6n^ exp 



2 2 



6'^en 



However, we showed that if \TnA^^'' \A^^'' \ = O (^^/n^ log raj , then one 
of the events Ej{i) occurs. Hence this proves the claim. □ 



Let M{S) denote the event that |T n A^^) \ A^^^\ > 6^/nMogn, so 

by the claim, P(M(S')) ^ 1 — Gn^exp ( — j . Finally, recall that, 

assuming the event J{x) holds, we had at most 

n/2 + 6^e'^n\ / e \^'^'" f 5'^en\ 

choices for the set if e is sufficiently small, since {-^lY < e^^^^ if x is 
sufficiently small. 

Now, we claim that if each of the events J(x), F{S) and M{S) holds 
(for all 5" as above), then x G A^^\ Indeed, ^(5") A M{S) implies that 
a + 2b > n2/4, and therefore that S fl A^^^ ^ 0, as explained above. If 
this holds for every S*, then it follows that x G A^^^ . Therefore, by ([1]), 

(ED, (ED, (SD and (ISD, 



5 S 

^ exp (-5Vn) + exp {^-^-^ (^Gn^exp (^-^^^ + 2n~" 
^ exp ( — cn) 

for some c = c(5) > 0, as required. □ 

Finally, we make the (easier) jump from exponential to super-exponential 
probability; this step takes us from round /c, to round 2k W. 
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Lemma 17. Let k E N, and let n he sufficiently large (in particular, 
n ^ Ak). Let x E Qn and c > 0. Suppose that p is chosen such that 

for each y E Qn- Then there exists a constant d = d{c, k) > such that 

Proof. As in Lemma [161 note first that 

if S C Qn satisfies d{y, z) ^ 2k + 1 for every y,z E S with y z, 
then the events {y E A^'''^}yi^s are independent. 

Let X E Qn, and let m = {k + 1)(^) ^ '2,n''. By Lemma [HI there exist 
disjoint sets -Bi, . . . , Bm such that IJ-^j = A; + 1), and for each 
j E [m], the events {y E A^'''>}y(zBj are independent. 

The argument is now very simple. Suppose x ^ ^4*^^^+^^; we claim 
that for each t E [0, A; + 1] there exist a set T{t) C S{x, t) such that 
T(t) n = 0, and 

Indeed, let T(0) = {x}, and note that since x i ^(2fc+i),T(0) satisfies 
the conditions. Now, suppose we have found Tit) as required. Then 
each y E T{t) has at most n/2 neighbours in S{x,t + 1) fl y4'^^^~*\ and 
so at least n/2 — t neighbours in S{x,t + 1) \ A'^^'^"*). Moreover, each 
element of S{x,t + 1) has exactly t + 1 neighbours in S{x,t). Thus, 
by counting edges, there must exist a set T(t + 1) C S{x,t + 1) with 
T{t + 1) n = 0^ and \T{t + 1)| ^ (n/2 - t)\T{t)\/{t + 1), as 

required. The second inequality holds since n ^ 4k. 

Consider T{k+1), and note that it has at least an^~^^ elements, where 
a > does not depend on n. Thus, there must exist a j G [m] and 
an absolute constant e > (not depending on n) such that \Bj\ ^ en, 
and \T{k + l)nBj\ ^ £\Bj\. Indeed, if no such j exists, then we would 
have |T(fc + l)| ^ emn + e < Oin^~^^ if s is sufficiently small (recall 
that m ^ 2n^). 

Now, recall that ¥(y ^ A^'^^) < e"'^" for each y E Bj, that the events 
{y E A^^^y(.B, are independent, and that T{k + 1) n A^^^ = 0. Thus, 
by Lemma [7] we have 



P {\T{k + 1) n B^\ ^ 6\Bj\ I \Bj\ ^ en) ^ e"^!^^!' ^ e"''"''^' 
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for some b = b{c, e) > 0, and so 

P(x ^ ^(2'=+^)) ^ P(3j with \Bj\ ^ en and \T{k + 1) n Bj\ ^ e\B^\) 

^ me"''"'"' ^ e"'^"' 

for some d = d{c,e) > 0. Since e = e{k), the lemma follows. □ 

Before moving on to the lower bound, let us put together the pieces 
from this section. 

Corollary 18. Let X > 1/2, let n ^ N be sufficiently large, and let 



1 1 / log n X log log n 
= 2 - 2V^ + T^T^- 

Then, in majority bootstrap percolation on Qn, with initial set A of 
density p, 

F{A percolates) — > 1 

as n oo. 

Proof. Let n and p be as given. By Lemma [15] it follows that P(x G 
A^"^^) ^2/3, for sufficiently large n (depending on A). So, by Lemma [T6l 
it follows that P(a; ^ A^^^) ^ e"'^" for some c = c(A) > 0, and thus by 
Lemma [HI it follows that P(x ^ A^^^)) ^ e-"^"" for some c/ = d{X) > 0. 
Finally, the union bound gives 



F{A does not percolate) ^ P ( |J ^ ^^"^) ) 

\x&Qn / 



^ J] P(x ^ ^ 2"e-'^'^' = o(l), 

as required. □ 

5. Proof of the lower bound in Theorem [T] 
Let G N be sufficiently large, and let 



1 1 / log n X log log n 

pin) = - — -\ + — , 

' 2 2 V n V^logn ' 

for some A G M, as in the previous section. We shall couple the 
bootstrap process with a modified process in which, if A ^ —2, then 
^(4) ^ ^(3) _^ ]^jg]^ probabihty. 

We shall start slowly, and build up to the full result. First, we con- 
sider just a two step process, and show that ifp ^ 1/2 — e then the 
original process does not percolate. Let us refer to the original (ma- 
jority) process as Boot, and let t = t{n) ^ be any non-negative 
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function. We define the process Bootl(t) as follows. 

• The elements of A^^^ are chosen independently at random, each 
with probability p. 

• X e v4(i) if X e or |r(x) n v4(°)| ^ n/2 - t. 

• If i ^ 1, then x G A^'+^^ if x G A« or |r(x) n A^] ^ n/2. 

Note that Boot = Boot(O), and that the process Bootl(t) dominates 
the process Boot, in the sense that given the same initial set A^^\ then 
for each k eN, the set A^''^ given by Bootl(t) contains that given by 
Boot. 

The following simple result, together with Corollary [181 implies that 
the critical probability for percolation in the hypercube is 1/2 + o(l). 

Proposition 19. Let e > 0, and suppose p = 1/2 — e and t = en/A. 
Then in Bootl(t), A^"^^ = A^^^ with high probability. 

Proof. Let x G Qn, and suppose x G A^"^^ \ A^^\ Then there must exist 
a set T C T(x), with |T| = t, and T C A^^^ \ A^^\ This is because 
X G v4(2) \ y4(^) implies |r(a;) n A^^^l ^ n/2, and x ^ A^^) implies 
|r(x) n < n/2 - t. We shall show that P(T exists) < e"^"" for 
some c > 0. 

Indeed, recall that S{x, k) = {y E Qn ■ d{x, y) = k} for each k E N, 
and consider the set T(T) fl S{x,2). It has (*) elements with two 
neighbours in T, and t{n—t) elements with one neighbour in T. Denote 
these two sets B and C respectively. Now, we claim that since T C 
\/l(o), we have 

IcnA^l ^ t(--t) - 2(1] ^ - - 2t\ (6) 



This follows by counting edges. Indeed, note that each member of T 
has at least n/2 — t neighbours in ^4^°^ and we assumed that x ^ A/-^'^ . 
Each vertex of C has only one neighbour in T, so even if every member 
of B is in we still get the bound ([6]). But 

E(|CnAW|) = pt{n-t) = - ent - ^t^ 
nt 

^ — — ent, 
2 

and \C n A^^^\ ~ Bin(t(n — t),p), so, since t = en/A, 

.mh nt A f 2ient~2t^f\ ( e^n^' 
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by Lemma m But we have only at most 2" choices for the set T, so 



< e 



for some c > 0, and so, since |^((5n,)| = 2", 

p(^(2) y^(i) _^ 0) ^ 2"P(x G \ A«) = o(l), 
as required. □ 

The following corollary is immediate from the proposition and Corol- 
lary [13 

Corollary 20. p,(g„,n/2) = ^ + o(l). 

Proof. By Corollary [18] we have pdQn, n/2) ^ 1/2 + o(l), so let e > 0, 
t = en/4 and p = 1/2 — e, and consider the Bootl(t) process on Qn- 
First note that A^^'' ^ ViQnj with high probability, since, by LemmalU 
P(|A(^) \ ^ 2"/100) = o(l). Thus, by Proposition [I9l the process 
Bootl(t) does not percolate with high probability. 

Now, coupling Boot with Bootl(t) in the obvious way, we see that 
also Boot does not percolate whp, so Pc{Qnin/2) ^ 1/2 — e. □ 

Remark 2. In fact, letting e = an ' for some a G M with > 32, 
the same proof in fact gives 

1 a 
Pc{Qn,n/2) > - 



2 nV3' 

since F{x G A^^) \ A^^^) ^ 2"e-"'"/^ and so P(A(2) \ ^(i) 7^ 0) = o(l). 

In order to improve this bound, we have to allow the process to go 
a little further. We call the following process Boot3(t). 

• The elements of A^^^ are chosen independently at random. 

• X G A^^^ if X G A^^'' or |r(x) fl A^l ^ n/2 - St. 

• X G A^"^^ if X G A^^'^ or |r(x) fl AW| ^ n/2 - 2t. 

• X G if X G or |r(x) n ^ n/2 - t. 

• If i ^ 3, then x G if x G A^'^ or |r(x) n AW| ^ n/2. 

Note that, since we are trying to distinguish between values of p 

(log log n \ 
■ , we should take t no larger than n times 
y/n log n J 



n 

this. In fact, we shall show that if t = , p is as above and 

logn 
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A ^ —2, then in the Boot3(t) process we have A^'^^ = A^^^ ^ with 
high probabihty. We begin by showing that A^^^ ^ Qn (in fact we only 
need the shghtly weaker condition, that A < 1/4). 



T n^ T,\^^l^ ^ Aloglogn 

Lemma 21. Let \ < \ A, Let o = - \ , ana suppose 

2 V n vnlogn 

1 I n 

that p = S and t = ./ . Then in Boot3(t), A^^^ ^ Qn whp. 

2 Y logn 

Proof. Let x G Qn- We shall show that P(x ^ A^^^) > e"*^" for some 
small c > 0, and that there is a set {xi, . . . ,xe}, with i ^ 2"/n^, 
for which the statements Xi G Qn are independent. The proof that 
F{x E A^^'^) is not too big is similar to that of Lemma [T5l 

First we show that F{y e A^'^^ \ y ^ = o(l), for each y e 

r{x). In fact, we shall need a slightly stronger result: that this still 
holds, given any set r(a;) fl A^^\ Clearly the events y E A^"^^ and 
are positively correlated, so let y G r(x) \ A^^\ and as- 
sume that r(x) n = r(x) \ {y}. Now, let R = T{y) n and 
^ = r(?/)nA«\A(o). WewanttoshowthatP(|i?| + |5| ^ n/2-2t) 
as n oo. This follows easily from the following claim. 

Claim : P (|5| > e^°v^ (logn)^^) = o(l). 

Proof of claim. We use the second moment method (Lemma [T^ . First 
we must bound the expected size of S. By Lemma H] we have, for each 
vertex z G T{y), 

p(|r(z)nA(°)| ^ ^-3t) = p(|r(z)nv4(o)\r(x)| ^ ^-3t-i 



exp 



2 

2{6n - Atf 



n 



1 1 /logn A log logn . , , 

where o = p = -\ , smce z has only one other 

2 2\n y/n\ogn 

neighbour in r(x), and t ^ 1. Observe that 

, 32^2 logn , e , f {loglognf 

2d n — loot H = — 2A log logn — 8 + O ' 



n 2 \ logn 

and so 

E{\S\) = nF{z ^A^^^)F{\T{z)nA^^^\^n/2-3t) 



— + 2A log logn + 9j = e^y/n{\og 



n] 
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since n is sufficiently large, so we can assume the term O ^ '°s ") 



log n 

is at most 1. It is similarly straightforward (using Lemma [5]) to show 
that Ed^l) — > cxD as 77, — »• cxD. 

Now, we need to show that the variance is not too big. Consider two 
distinct vertices u,v E T{y) \ {x}, and note that \T{u) fl r(t>)| = 2, and 
that F{u e S\v e S) ^r{u e S). Let r{u) n r{v) = {w,y}, and also 
let T{u) n r(x) = {a,y}. Then, since we assumed y ^ A^'^\ 

¥{ueS\veS) ^ ¥{ue S\w e A^'^^) = {l-p)F{S'{n) n/2-3t), 

where S'{n) = \ (r(n) n U{a,w}\{y}\ ~ 2 + Bin(r2 - 3,p). But 

P(u e S) = {l-p) ¥{S{n) ^ n/2 - 3t), 

where S{n) = \ n U {a} \ {y}\ ~ 1 + Bin(n - 2,p). Thus, 

by Lemma [TU| 

F{ueS\veS) = {l + o{l))¥{ue S), 

and so, since EdS*!) — > oo as n — > oo, we have VardSI) = o(E(|S'|)^). 

By Lemma [HI it follows that l^l ^ 2E(|S'|) with high probability. 
Thus 

P(|^| > e^°v^(logn)2^) ^ P(|5| > 2E(|5|)) = o(l), 

as claimed. □ 

Now, let m = e^^^/n{\ogn)'^^, let R denote the event that \R\ ^ 
n/2 — 6n/2, and let S denote the event that 15*1 ^ 6n/2 — 2t ^ 6n/3. 
Note that 6n ^ m since A < 1/4. Thus P(i?) = o(l), by Lemma HI 
and F{S) = o(l), by the claim, and hence 

P(?/ G |y ^ ^ P(^) + P(^) = o(l). 

To complete the proof of the lemma, we need to use the simple fact 
that for positively correlated events Ei, . . . , Ek, 

(k \ k 
f]EA ^ l[F{E,). 
i=l / 1=1 

Given y G Qn, let T{y) denote the event that y ^ A^'^\ Noting that 
the events {T{y)}yf^Y{x)\AW are positively correlated, we have, for any 
set T{x) n A^^\ any (small) e > 0, and sufficiently large n, 

P(r(x) n \ = 0) = P I Pi T{y)\ ^ (1-e)", 

\yer(x)\A(o) / 
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and therefore 

P(a;^A(3)) ^ P(x^ A(o))P(^|r(x)nA(°)| ^ ^-3t)p I p| T{y) 

\yeT{x)\A(0) 

^ ^ X (1 - £) X (1 - £)" ^ exp (-2£n) . 

Finally, if d{u, v) ^ 7 then the events u G A^^^ and z; G A*^^) are 
independent, so, by Lemma [H there exists a set K of size at least 
2"'/n^ for which the events {x G A^^-'j^gi^- are all independent. Thus 

P(x G ^(3) for all x G iT) ^ (1 - e-^"")!^! = o(l) 

if £ is chosen to be sufficiently small. This completes the proof. □ 

Next, we shall show that the Boot3(t) process stops after at most 
three steps if A ^ —2. We shall need the following lemma about count- 
ing 3-uniform hypergraphs. 

Given a 3-uniform hypergraph H, and i,j G [n] with i < j, we shall 
write dniiyj) for the degree of the pair {i,j} in H, i.e., 

dH{i,j) = \{k:zjkeEiH)}\. 

We write if ^ G if if is a (not necessarily induced) sub-hypergraph of 
G, and define \\H\\ = J2^<j {''"'i''^). 

Lemma 22. Let G be a labelled S-uniform hypergraph with n vertices. 
Further, let t'^ ^ n t^ , s ^ t^ , m e N, and 

S{G, m,s) = {H ^G: e{H) = s and \\H\\ ^ m}. 

Then, for sufficiently large n, 

' ^ ' ' ^' \ s J \s-2m/n + 2t^/n) 

Proof. Let H G ^(G,m,s), so H ^ G, e{H) = s and \\H\\ ^ m. We 
partition the elements K = {{i,j) : i < j} into two sets, which we 
imaginatively title 'big' and 'little'. To be precise, let r = r(t) be a 
function to be determined later, let 

Ki = {{i,j) : dniij) > r}, 

be the big set, and let = \ -^i be the small one. Let li^il = ki 
and \K2\ = ^2, let 

iy = {e G E{H) : {i,j) G Ki for some i,j G e}, 

and \L\ = £. We shall show that if \\H\\ is large, then i must be large, 
and hence that there are only few choices for H. 
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Indeed, first note that J2i<j dnih j) = 3s, since each edge contains 

3s 

exactly three elements of K, and hence ki ^ — . Also, note that 

r 

dH{i,j) ^ i + i ^ j , since i edges intersect Ki, and each pair of 

elements of Ki is contained in at most one edge e G E{H). Thus we 
gain at most (^2^) extra in the sum from those edges of L which contain 
more than one element of Ki. 

Now we reach the crux. Recall that dnihj) ^ n for all i and j, and 
dnihj) ^ T if G K2. Thus, by the convexity of (2) (Observa- 
tion [T2]), and letting r = we have 



\H\\ ^ 



3s 

since A;i ^ — ^ 6t, so kfn -C t^. 
r 

Now we have only to count. To determine H, it suffices to choose 
the ki pairs in Ki, and i edges incident to these pairs, and then 
s — i other edges. Thus, 

/ Gent V / e{G) 



.(o,».)i.E(::)Cr)(fD<«E 



\ s / \s 



since ki ^ 6t, and using the trivial bound £ ^ s. But ( ] is 



decreasing in £, and i ^ by ([7j), so 

n 



s 



s 

I z\ \ni, \ ^ 

^ 2 



J \s -2{m-t^)/n 



■20ntY / e(G) 



s / \s — 2m/n + 2t^/ny 
for sufficiently large n, as claimed. □ 
We are ready to prove the key lemma. 



Lemma 23. Let X ^ —2, let 5 = -\ — o,nd suppose 

2 V n ^n\ogn 

1 I n 

that p = - - 5 andt = J- . Then m Boot3(t), A^*^) = A^^) whp. 

2 y logn 
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Proof. Let n G N be sufficiently large, let x G Qn, and suppose that 
X G A^^^ \ A^^\ Then there exists a set T C r(x) with |T| = t, and 
such that T C A^^^ \ A^^\ This is because x G A^'^^ \ A^^^ implies 
|r(a;) n ^ n/2, and x ^ A^^) implies \T{x) D A^"^^] < n/2 - t. It is 
convenient to think of the vertices of Qn as subsets of [n] , and to assume 
(without loss of generality) that x = 0, and that T = {{i} : i G [t]}. 
We have at most 2" choices for the set T. 

Similarly, each vertex y & T must have at least t + 1 neighbours in 

^(2) \ ^(1). 

bmce X ^ A^^\ these are in S{x,2), and note that each 
vertex of S{x, 2) has at most two neighbours in T. Thus there must 
exist a set T' C (A^^) \ ^(i)) n S{x, 2) n r(T) with 1^2 ^ |T'| ^ Let 
|T'| = t'. Given T, we have at most ^^(^2*) choices for T'. 

Using the same logic one more time, each vertex y must have at 
least t + 1 neighbours in A^^^ \A^^\ and at least t of these are in S'(x, 3). 
Each vertex of S{x, 3) has at most three neighbours in T', so there 
must exist a set ^ C {A^^^ \A^^^)nS{x, 3) nr(r') with t^/Q ^ \S\ ^ t^. 
Let IS"! = s, and, considering S* as a 3-uniform hypergraph on [n], 

let ll^ll = m. If m < t^, we could use the trivial upper bound ("* ) 
on the number of choices for S (given T'). For m ^ however, we 
shall need the following stronger bound, which follows from Lemma [221 

Claim 1 : Given T', s and m, there are at most 



potential sets S with jS"! = s and II^H ^ m. 

Proof of Claim 1. Let G be the 3-uniform hypergraph on [n] with edge 
set E{G) = {e G S{x, 3) fl r(T')}, and similarly consider S' to be a 3- 
uniform hypergraph in the obvious way, i.e., -^(5*) = {e G S{x, 3) fl S}. 
Note that S C e[g), \E{G)\ ^ nt' ^ nt^, ^ n <t: t^ and s ^ so 
the result follows immediately by Lemma [221 □ 

Now, consider the neighbourhood D of S in S{x, 4), and let d be the 
number of edges of Qn between S and D. Next, partition D into four 
parts, Di, D2, -D3 and D4, where each element of Di has i neighbours in 
S, and let \Di\ = di. Furthermore, let Ri = DiCiA^^^ and let \Ri\ = ri. 
We have 




d 



di + 2d2 + M-i + 4(^4 



[n — 3)s, 



and 



r 
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since each vertex in S has at least n/2 — 3t neighbours in A^^\ and at 
most two neighbours in A^'^^ \ S(x,4:). Also, 

E(r) = pd = p{n — 3)s ^ s ^— — 6n 

and Tj ~ Bin((ij, p) for i = 1,2, 3, 4. 

In order to apply Claim 1, we shall need some bound on \\S\\. The 
following claim gives us one. 

Claim 2 : \\S\\ ^ d2 + Sd^ + 66/4. 

Proof of claim. If ijki G -D2 then ijk G S* and iji G S*, say. Thus 
we add one pair to ds{i,j). Similarly if ijki G -D3, we add a pair to 
ds{hj)y ds{i,k) and ds{i,i), say, and if ij/c£ G -D4, we add a pair to 
each of the six degrees. Finally, each pair of 3-sets are both contained 
in at most one 4- set. □ 

Now we apply Lemma El Let m = d2 + 3d^ + Gd^, so \\S\\ ^ m. By 
Claim 1, we have at most 



2 



/'20nty / nt^ \ 

\ s ) \s -2m/n + 2t^/n) 



ways of choosing the elements of S. Thus, recalling that s ^ ^ n, 
so 2"(5ns)^ <^2^, we get 

X F(r^ E(r) + {5n-4t)s^ 
^ / 30072 V ent2 V'^^^ / 2((5n-4t)V\ 

^ L (^^J U^^^T^j ^(4^J' 

di,d2,d3,d4 ^ \^ n ^ n / ^ \ I / 

where -D(4) = rfi + 4(i2 + 9^3 + 16^4, as in Lemma [61 Note the replace- 
ment of 2t^ /n by 3t^/n, which is motivated by the later computation. 

The rest of the proof is just a straightforward calculation. Indeed, 
consider 



= ) [s-2m/n + 3t^/n ) V ^(4) 
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where {n — 3)s = di + 2^2 + 3^3 + id^ and m = d2 + Scia + Gd^, and 
note that 



L)(4) = 6/1 + 4^2 + 94 + 164 

= {n- 3)s + 2d2 + 64 + 124 = (n - 3)s + 2m. (8) 

This trivial observation will allow us to bound M(d) from above for 
all (cii, 4, 4, 4)- 

Indeed, recalhng that 5 = -\ — ^ ^ and t ^ 



2 V n y/n log n y log n ' 

first observe that 



n log — 4An log log n ^ 4(5n — 4t)^ ^ n log — 4An log log n — 32(5tn 

^ n log n — 3Xn log log n 



since n and 1 ^ 32(5t ^ log log n. Thus, if m ^ asn for some 

a ^ 0, then 

2(5n - Atfs'^ 2{5n - Aifs"^ Am{5n - AtYs"^ 



sn + 2m sn sn(sn + 2m) 

s(logra — 3Aloglogn) m(logn — 4A loglogn) 
^ 2 (1 + 2a)n ' ^ ' 



Finally, we may assume that s — 2m/n + 2t^/n ^ 0, as otherwise 

f ^ _ , ) = 0. Thus, applying (Q with a = 0, and using 

s — 2m/n + 2t^/?2 / 

O), we have 



^ l^J l^J .n + 2m 



2m I 



/ A"/" /^'V*'^V 900n5/2(logn)3V2 Y 

Vn3(iogn)4A y V^3y V / 
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Now, recall that — = loerra, and s ^ — . So if m < — , then 

/ 1 \ rn/n 

(gOOe'^logr!)^^*'/*)' s; (logn)"'". 

STl 

However, if m ^ — , then we may apply (j9]) with a = 1/4, so 
/QOOnV l^^'Y'^^^^.. / 2{5n-Atfs^ 



^^^^ ^ m UJ ^^^V -+2^ 



i-lS X W3n /^2X 3*Vn / QQOn^/^ (^l^gn)3X/2 



Cn20 (log n) \ / C(log n)20+10A X «/12 



y V ^2 



for some constant C. (Note that in both calculations we made the 
substitution m = sn/4; we could do this in the first case because 
4A + 3 < 0, and in the second because n^^ ^ t^^.) Since t ^ logn, it 

follows again that M(d) ^ (logn) for sufficiently large n. 
Thus, since di, . . . ,d4 ^ sn, 

n3S\s,T') ^ Yl (logn)"'^' ^ H'(logn)-*'/'', 

and we have at most choices for s, and at most 2^"^^ ("2*) choices for 
X, T and T'. Thus, summing over all of these, we obtain 

P(3X e A(^) \ ^ t222„ (^,3)4 ^ ^^^^^ 

as required. □ 
At last, we are ready to prove Theorem [H 
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Proof of Theorem Ul The upper bound in the theorem is exactly Corol- 

fi 

lary [T8l so let A ^ —2, let n G N be sufficiently large, let t = , 

logn 

let 

1 1 /logn A log log n 

2 2\ n vnlogrz 
and consider the Boot3(t) process on Qn- 

By Lemma ED we have ¥{A^^^ = V{Qn)) = o(l), and by Lemma [2l 
we have P(A(^) ^ A^^)) = o(l). Therefore, F{A percolates) = o(l) in 
the Boot3(t) process. The obvious coupling of Boot with Boot3(t) 
now shows that F{A percolates) = o(l) in the original process also, as 
required. □ 

We conclude the section by briefly discussing ways in which Theo- 
rem [1] could be strengthened, and the limitations of our method. The 
alert reader will no doubt have noticed that the constant A = —2 is 
not sharp; indeed, with a little more care (and no extra ideas) we could 
have proved that A is unlikely to percolate whenever A < —3/4. 

However, our method, as it stands, cannot prove the result for any 
A > — 1/4. To see this, consider an ideal set S, with \S\ = and 
II 5*11 small. Using our method (based on the random variable r), and 
Lemmas H] and [5l it has probability at most about 

exp {-25^nt^) 

of being contained in A^^^ \ A^^K There are about (^"fs ) such sets, and 
so, writing X for the number of suitable sets S C A^^^ \ A^^\ we get 

P(X ^ 1) ^ E{X) ~ ( ^3 ) exp {-2S^nt^) 

fn f logn , AV /, x(2A+i/2)t3 

^ I— exp I ^ |-2Aloglognl I ^ (lognj 

which is small only if A < —1/4. 

There are two obvious places in which we could potentially be leaking 
probability. The ffist is in our estimation (using r) of the probability 
that S is contained in A'^^'^ \A'^^'>] the second is in the inequality P(X ^ 
1) ^ E(X). A heuristic calculation suggests that Var(X) is not too 
big, and so we suspect that the ffist of these is in fact the problem. 

Finally, we point out that two simpler changes, which one might 
think would improve the result, in fact do not help. Firstly, we could 
take the process one (or more) step(s) further (i.e., consider a Boot4(t) 
process), but we would just run into the same calculation, with 
replaced by t^. Alternatively, we could increase t; however, we cannot 
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do SO significantly, since we need the inequality 6t -C log log n in order 
to prove inequality 

6. Percolation on cZ-regular graphs 

In this section we shall prove Theorem [21 which uses the ideas of 
the previous two sections, and deduce Corollary [31 Throughout this 
section, let G be a graph as in the statement of Theorem [21 so G is a 
(i-regular graph on vertices satisfying 

\Six,t)nTiy)\ ^ m 

for every x,y E V{G) with y G V{G) \ B{x,i — 1), and every i G [k], 
where d,N,k E N, and /i, . . . , : N — N are functions satisfying 

1 ^ Md) < 

fclogct 

for some small constant rj > 0, to he chosen later. Since percolation on 
a 1-regular graph is not very interesting, let us assume that d ^ 2. In 
fact, when rj is small, the inequality above implies that d ^ l/t]. 

We begin with the upper bound; the idea is that, by Chernoff's 
inequality (Lemma [1]), all but exponentially few of the vertices are in 
A^-^\ and therefore that F{x ^ A^''^) -C First we use Lemma [S] to 
prove a version of Lemma [9] applicable to the graph G. 

Lemma 24. Let d,k E N and G be as described above, and let x G 
V{G). Then there exists a partition 

S{x, k) = BiU ...UBm 

of S{x,k) into m ^ d(^fk-i{d) + fk{d)) + 1 sets, such that ify,z G Bj 
for some j , then d{y, z) ^ 3. 

Proof. Let y G S{x, k), and consider the set 

Y := B{y,2)n S{x,k) = {z e S{x,k) : d{y, z) ^ 2}. 

We claim that |y| ^ d(^fk-i{d) + fk{d)) + 1. Indeed, ?/ has (i neighbours, 
of which none are in B{x, k — 2), at most fk-i{d) are in 5'(x, k — 1), 
and at most fk{d) are in Y . But if 2; ^ k — 1) then it has at most 
fk{,d) neighbours in S{x,k), by assumption. Thus 

|F| ^ 1 + fk{d) + dfk-iid) + {d-fk-i{d))fk{d) 

^ d{fk-i{d) + fk{d)) + 1 

as claimed. Now, by Lemma [H] applied to the graph G[S{x,k)], it 
follows that the claimed partition exists. □ 

We are ready to prove the main step in the upper bound. 
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Lemma 25. Let d,kEN,0<r]^l/6 and G he as described above. 
Let X G V{G) and c > 0. Suppose that 21] ^ c, and that p is chosen 
such that, in the majority bootstrap process on G, 

Then, 



3'=+i(/fc_i(d) + /,(d))nti/.(c?) 

Proof. The proof is very similar to that of Lemma [TTl and so we shall 
give only a sketch. Let x G V{G), and let m = d[fk-i{d) + fk{d)) + 1. 
By Lemma [2ll there exist sets Bi, . . . , Bm such that [J Bj = S{x, k), 
and for each j G [m], the events {y G A^^^}y^Bj are independent. 

We proceed as in the proof of Lemma [T71 suppose x ^ then 
for each t G [0,k] there exists a set T(t) C S{x,t) such that T(t) fl 

I^WI ^ n ■ 

3*n::;/.(rf) 

Indeed, let T(0) = {x}, and note that since x i A^^^^\ T(0) satisfies 
the conditions. Now, suppose we have found T(t) as required. Then, 
each y G T(t) has at most djl neighbours in Six., t + 1) fl J^^~^^ (since 
y ^ A^*^"*"-^"*^), and at most ft-\{d) + ft{d) ^ dj'o neighbours in B{x,t) 
(by the properties of G, and since rj ^ 1/6), and thus it has at least d/3 
neighbours in S{x, t + 1) \ A^''^'^\ Moreover, each element of S{x, t + 1) 
has at most ft{d) neighbours in S{x,t). Thus, by counting edges, there 
must exist a set T{t + 1) C S{x, t + l)\ A^^^'^^ such that \T{t + 1)| ^ 
d\T(t)\/3ft{d), as required. 

Now, define 

d'' 

and note that |T(fc)| ^ mN{k) and that therefore, by the pigeonhole 
principle, there must exist j G [m] such that \T{k)r]Bj\ ^ \N{k)~\ ^ 1. 

Now, recall that F(y ^ A^^'^) < e~'^^ for each y E Bj, that the events 
{y G A^^^}y^Bj are independent, and that T{k) fl A^^^ = 0. Observe 
also that, since klogd ^ rjd, 

Thus, by Lemma [7] we have 



5jf ^ d^V"'^ = exp (-cd + 2r/ci) ^ I. 



F{\nk)r^B^^N{k)) ^ 2{e^^y'^' = 2exp(-^^) 
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and so 

P(x ^ ^(2'^+^)) ^ P(3j with \T{k) nBj\^ N{k)) ^ 2m exp 



^ (f exp 



as required. □ 



The upper bound in Theorem [2] will follow easily from Lemma 
(see 'Proof of Theorem [2l, below), and so we now turn to the lower 
bound. The method is based on that of Section [Sj we begin by defining 
the natural generalization of the Bootl(t) and Boot3(t) processes. 
Given k,t eN, we call the following process Bootk(t). 

• The elements of A^^^ are chosen independently at random, each 
with probability p. 

• IfO^m^A; — 1, then 

X e if X G A^'") or |r(x) n ^ n/2 - {k - m)t. 

• Um^k, then x G A^^+i) if x G A^*") or |r(a;) n ^ n/2. 

1 ed 

We shall show that, if p = e and t = — , then in the Bootk(t) 

2 Sru 

process we have ^(^=+1) = A(^) ^ V{G) with high probability. The 
following lemma is the key step. 

Lemma 26. Let d,k E N, rj > and G be as described above. Let 
1 ed 

e>0, p = -— e,t= — and m E [k], and suppose that 12// ^ e^. 
Then, in the Bootk(t) process, for every x G V{G), 

Proof. Let x G V{G), and suppose that x G \ A^™). Then we 

claim that, for each £ G [0,m] there exists a set T(£) C ^(x,^) such 
that T(£) C \ and 

^ |r(£)| ^ 

Indeed, let T(0) = {x}, let t G [0,m — 1], and assume that we have 
found T[t) as required. Now, let y G T(£), and note that, as in the 
proofs of Proposition [T9] and Lemma [231 since Y G A^"^~^^^^ \ A^^~^\ 
it follows that y has at least t neig hbours in A^'""^) \ A^'^-^-^K 



P(x G ^("+1) \ ^ exp 
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Recall that y G S{x,^)^ so y has at most 

neighbours in B{x,i), since 121] ^ elogd. Thus y has at least t/2 
neighbours in S{x, £+l)nA('"-^)\A("-^-^), and each vertex of £+1) 
has at most fi{d) neighbours in T(£), so, by counting edges, 



as required. 

Choose a set T(m) as described, let s = \T{m)\, note that T{m) C 
S{x,m) n y4*^^^ \ A'^''^ and consider the neighbourhood of T{m) in 
S'(x, m + 1). As in the proof of Lemma [23| we shall consider the number 
r of edges between T(m) and W whose end-point in W lies in A'^^\ and 
show that it is far from the expected number. 

First note that each vertex in T{m) has at least - — mt neighbours 

in A^^\ and at most fm-iid) + fm{d) neighbours in B(x,m), so 

fd p / ,\\ sd esd 
r ^ s I - -mt - - 1 ^ — 

, „ , rid £d 
smce mt ^ — and f,- a ^ ^ — . Next, observe that 

sd 

E{r) ^ psd = — — esd. 

Now, each vertex of W has at most fm{d) neighbours in T{m); let di 
be the number of vertices with exactly i neighbours, and recall (from 
Lemma [6]) that D{k) = ^i^i'i'^di for each A; G N. By Lemma [6l we 
have 



2D{fUd)) 



Observe that D{fm{d)) ^ fm{d)^tf' idi ^ fm{d)sd, note that 

.t^ t £ log d 

^ ^ , and recall that 48?7^ ^ e^logd. It then 



2/m((i) Qr] 
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£^ sd s^djXo^d 

follows that — „ , ^ ^ Arid, and thus 

2/„(c/) " 127] " ' ' 

P fr ^ E(r) + ^ exp f log(rf^+^)^ exp ^ 



^ exp I 2r]d — ^ exp 



since s ^ d^ . 

We have shown that the probability of a particular s-set being con- 
tained in A^^^\A^^'^ is small; now we simply sum over all possible s-sets. 

There are at most I ) choices for the set T(m), and so 



^ e^sd e'^kslogd ^ i , i i f d"'\ 

But , , , ^ : ^ Sms log ct, and log ^ ms log a, so 



/ e'^sd \ ( e^sd 



Thus, using our lower bound on s, and recalling that t = — , we get 

e^d \ i r 



^ exp 



^r?i+2^r?i+l 



as required. □ 

We are ready to prove Theorem[2l The crucial property of G = G{d), 
which we have as yet not used, will be that = |l^(G')| satisfies 



N ^ exp 



for every c? G N. 



d'' 



[uj{d)kY{h^M + ud)) nti m 



Proof of Theorem\^ We shall deduce Theorem [2] from Lemmas [25] and 
In order to apply these lemmas, we need rj to be sufficiently small. 
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SO first let e > 0, recall that fi{d) ^ f{d) for each i e [/c], and let d be 
sufficiently large so that 

We begin with the upper bound. Let p = 1/2 + e, and choose the 
elements of A^^^ C V{G) independently at random with probability p. 
Let X G V{G), and recall that = we shall show that 

p (^(fc+i) ^ v{G)) ^ NF{x^ = o(l) 

as d — 5> oo. 

Indeed, x has d neighbours, and so, by Lemma HI 

P (x ^ A^^^) ^ P(Bin {d,p) < d/2) ^ exp ( — e'^d^ . 
Thus, by Lemma [25| applied with c = e^, 

e^d" 



exp 



3'-+'(A-iW+AM) nr., /.M 



as c? — > oo, as required. 

Now we turn to the lower bound. Let p = 1/2 — e, and again 
choose the elements of A^^^ C V{G) independently at random with 

ed 

probability p. Let t = —, and recall the Bootk(t) process, defined 

Orh 

above. We shall show that, in this 'more generous' process, we have 
^(fc+i) ^ ^(fc) _^ V{G) with high probability. The result then follows 
by a straightforward coupling of the two processes. 

The first part, that yl^^^^^ = A'^'^^ in Bootk(t), follows immediately 
from Lemma [26l since 



p(^(fc+i) ^ ^(fc)) ^ E{\A^^+^^ \ A^l) = NF{xe A^^+^^ \ A^''^) 



^ exp 



d^ ( 1 

as c? ^ oo. 

For the second part, that A^^'^ ^ V{G)^ we again use Lemma 
First note that P (x G A^^^ \ A^^^) ^ exp ( - e^d) , by Lemma H Now 
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recall that, by Lemma [26| 

P(xe ^ expf- ^ ' 

^ exp{-d{logdD ^ exp{-e^d) 
for every m G [k]. Thus, by Markov's inequality, 

k-l 



m=0 



^ 4kexp{-e^d) = o(l) (10) 
as d ^ oo. Finally, again by Lemma HI 

P ( ^ exp (--] = o(l). (11) 



4 ^ 

Combining ([10]) and gives P(A('=) = 'l^(G')) = o(l), as required. □ 

Before proving Corollary [31 let us note that, in one direction at 
least. Theorem [2| cannot be improved substantially. Suppose k,f{d) = 
0(1), so the bound on N becomes N = exp (o((i'^)). The following 
example shows that the theorem is false if this bound is replaced by 
N = exp{0{d^+^)). 

Example 1 . Given two graphs Hi and H2 , we write Hi U H2 to mean 
the graph with vertex set V{Hi) U V{H2) and edge set E{Hi) U E{H2), 
i.e., the graph obtained by putting the graphs side by side. 

Let d,k eN (with d large) and C g M, let M = exp {Cd^+^) , let 
N = d^^^M, and let if be a (i-regular graph on d^^^ vertices which 
satisfies the conditions of the theorem. Then the graph 

G = /Ji U . . . U Hm, 

where the graphs Hi are disjoint copies of if, is a rf-regular graph on 
vertices which satisfies the conditions of the theorem, but 

Pc{G,d/2) ^ 1-e, 

where e = e{C) as C 00. This follows simply because if 
p < 1 —e, then with high probability at least one of the sets V{Hi) fl A 
is empty, and so no vertex of Hi is ever infected. 

To show that such a graph H exists, we have only to consider a 
random (i-regular graph on L = d*^"'"^ vertices. With high probability, 
such a graph satisfies the conditions of Theorem [21 with f{d) = k + 3. 
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To see this, let H be such a graph, and note that S{x, k) ^ d!' for each 
X G V{H), so (heuristically) we have 

kU)\T) ^ (V) 

for each y G V{H) \ B{x, k — 1). Thus 



¥{3x,y:\S{x,k)nT{y)\^k + A) ^ 



0(1) 



L 

as required. It is straightforward to make this rough argument rigorous. 

Finally, let us deduce Corollary [3] from Theorem [2l Recall that 
[nY denotes the d-dimensional torus, i.e., the graph with vertex set 

{1, . . . ,n}'^ and edge set {xy : \xi — yi (mod n) \ = 1}. 

i 

Proof of Corollaryl^ Let n = n{t) and d = d{t) be functions satisfying 
the given inequalities, and let G = [n]'^, so G is a 2(i-regular graph 
on N = n'^ vertices. Let u be any function satisfying 1 uj{d) <^ 
log log d. We claim that G satisfies the conditions of Theorem [2] with 

uj(d)k = X and f(d) = k + 1. 

V log a 

Indeed, let x G V{G), and observe that for each m ^ k, 

d 

S{x,m) = {yeV{G) : ^\\xi- yi\\z^ = m}, 

i=l 

where \\xi — yi\\z„ denotes the distance between Xi and yi in Z„, the 
integers modulo n. Thus, given y ^ B{x, m — 1), we have 

\S{x,Tn) nT{y)\ = \{i : Xi ^ yi}\ < m + 1 

if ?/ G S{x, m + 1), and S{x, m) fl Tiy) = otherwise. 
All that remains is to observe that, for some C G M, 

logA^ = dlogn ^ 2^^^ ^ (logrf)'' = ^ 
and hence that 

d^ 



{u{d)ky 



N ^ exp 



{uj{d)kY {f,^^{d) + f,{d)) UtiMd). 
as required. □ 
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7. Further questions and conjectures 

In this section we shall briefly discuss various ways in which the work 
in this paper could be extended. We begin by conjecturing that the 
upper bound in Theorem [1] is sharp. 

Conjecture 1. 



. . 1 1 /logn loglogn ( loglogn 

Pc[Qn,n/2) = - - 7J\/— — + o / 1 



2 2\ n 2y/n logn \2y/n logn 
as n oo. 



Next, recall once again the result of Cerf and Manzo [H], that 

p,[[n]\d) = 0(1) 
when d ^ log^ n, and Corollary [3l which states that 

Pc{[n]\d) = ^ + 0(1) 

when d ^ £:(log log n)^ log log logn. The obvious question poses itself: 
What happens in between? 



Problem 1. Determine lim Pc {[nY, d) for every pair of functions n = 

t—>oo 

n{t) and d = d{t) for which the limit exists. 

In particular, determine the nature of the phase transition between 
those pairs of functions for which the limit is zero, and those for which 
it is non-zero. 

Another entirely natural question asks what happens to the critical 
probability if one changes the threshold function r = r{d)7 The proof 
of Theorem [2] extends easily to the case r = ad-\-o{d) for some constant 
< a < 1, and implies that 

Pc(M^r) = a + o(l) 

for the same functions n{t) and d{t) as in Corollary [31 However, the 
following problem is likely to be more challenging. 

Problem 2. Determine Pc{[n]''',r^ for all functions 2 ^ r{d) ^ d. In 
particular, characterize the pairs (n{d),r{d)^ of functions for which 

^Pc{[nr,r) -> 1 

as d oo. 
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We remark that in [5j, Balogh and BoUobas proved that Pc([2]'^, 2) is 

very far from -; in fact it is (up to a constant factor) equal to — — — . 

For even sharper results on a wider class of graphs, see [6j|. 

Finally, there are many other c?-regular graphs which Theorem [2] 
fails to cover; for example, those with more than 2^ vertices. Our 
final question asks, rather vaguely, for a version of the theorem which 
applies to such graphs. 

Question 1. Does there exist a set of 'local' conditions which allow 
one to determine (or hound) the critical probability for an arbitrary 
d-regular graph on N ^ 2"^ vertices? 

Unfortunately, it appears out of reach to prove anything (in general) 
for fixed d and N ^ oo. 

8. Appendix: proofs of the tools in Section [3] 

In this appendix we shall prove the simple tools used earlier. We 
begin by recalling Stirling's formula, 

. — fn\^ , ^ — (n 
y-nn y—j ^ n\ ^ 2\/Txn y — 

and by making the following basic observations. 
Observation 27. 

(a) Letx^l/A. T/ien e"^-^'" ^ 1-x ^ e"^. 



ib) Let0^5^1/8. Then e-""'-^'" ^ ^ ^ e'^^+^i^ 

(c) Let m,n satisfy 8m ^ n and Am? ^ n^. Then 

n \ 2"-i / 2m2 

^ exp 1 

n/2 + mJ yi^n \ n 



[d] Let X ~ Bin(n,p), where n G N and p = p{n) ^ - — S, where 
6 = 6{n) — > as n ^ oo. Let m = m{n) G [n/2]. Then 
P(X = m)= o(P(X ^ m)) 

as n oo. 

Proof. Part (a) is straightforward. For (6), note that 

1-26 ^ ^—^ ^ 1-25 + 252 
l + S 
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and apply part (a). For (c), note that by Stirling's Formula (applied 
to n\ and n/2\) and part (a), we have 

n ^ > / ^ ^ fn-2m\"' ^ 2"-^ / 2m\™ 
n/2 + mJ ^ \nl2) \ n J ^ a/v™ \ n J 

2"-^ / 2m2 4m3 
> exp ( — 



Tcn \ n 



as claimed. For [d), let n G N be sufficiently large, and observe that 
¥{X = k) = (^)p'^(l — for any k G [n]. Thus, for any integer 

^ t ^ i/n, we have 



P(X = m) V m + t ) \l-pj \n/2 + tJ \l + 26 

2t 

and so, applying part (b) twice, for — and for 26, 

n 

P(X = m + t) f At^ 
^ ^ ^ exp 55t - 1 



P(X = m) \ n 

since <C and <C 6t. 

Finally we sum from t = to 1/5, to obtain 

P(X ^ m) ^ P(X = m) ^ exp I 5(5t - 1 ) 



g-10 

> — ^ ^ CX3 



as n — >• 00. □ 

We are now ready to prove the results in Section [31 We begin with 
our reverse Chernoff bound. Lemma [H 

Proof of Lemma\^ First let C = 0. We have 

k=n/2 ^ ^ 



n—k 



Note that p{l — p) = ^ " '^^5 observe that since p ^ 1/2, the 

function g{k) = (^)p'^(l — is decreasing on [n/2,n]. Thus, for 

any m ^ 0, we have 



n/2 + mJ \A J \1 — p 
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Let m = ,1- . Now, by Observation I^TT c) we have 

log n 

/9 _L ^ ^ exp 1 , 

n/2 + mJ y/TTn \ n J 

by Observation [271(a) . and recalhng that 86^n ^ 1, 
/I \ ''''^ 

^ 2"" exp (-25^^ - 85^n) ^ 2"" exp (-25^^ - l) , 
and by Observation 1271(6) . 

J = ( ]^ _|_ ) ^ ^"^P (—45m — 165^m) ^ exp (—45m — 1^ 

Therefore, 

P I S(n) ^ - ) ^ — = exp 26^n - 45m - 3 

V 2/ 2y/7m \ n J 



> , exp ( -25^71 - 45. ^ 



2e^^/¥\ogn \ Y logn^ 

as required, since 2m^ ^ n and 2e^y^ < e^. For general C the proof 
is the same, since C ^ 171/2 for large n, so 

and 4e^y? < □ 

Next we prove Lemma El which generalizes Lemma HI to a weighted 
binomial distribution. 

Proof of Lemma\^ We prove the lemma by induction on k. For /c = 1, 
it is exactly Lemma l4l(a), so let k ^ 2, and assume it is true for k — 1. 
Recall that = Y^-i + kX^. Thus, by the induction hypothesis and 
Lemma HI 

p(n ^ E(n) + ^ p(n_i ^ E(n_i) + 1) + p (^x, ^ E{Xk) + 

+ J]p(n-i ^ E(n,_i) +m) P (Xk ^ E{Xk) + 

m=l ^ ' 



2e \ ( 2t^ 

^ exp —7777 TT- + exp 



+ V(2t)-^expr-^!^^^ 
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. 1 1 1 . , . tD{k-l) 

Now, simple calculus gives the maximum at m = — . 

D[k — 1) + K ttfc 

Thus 

, (t-m)2 ^ t^edkD{k-l)+t^{edkf _ 



D{k-1) k^dk k^dk{D{k - I) + k^dk) D{ky 

since D{k — 1) + k'^dk = D{k), and so 

p(r.JE(n)+<) « 2exp(-|l) +(<-l)(a)"exp(-|l) 



2f2 

^ (2t)^-iexp 



and the induction step is complete. □ 

The remaining lemmas are even more straightforward. 

Proof of Lemma^^ Recall that pn^ ^ 1, and that S{n) ~ Bin(r;,,p). 
Thus 

r(S(n)=m) < ( ^ M™ ^ 

\m J 

and 

P(5(n) =m + l) 1 
P(^(n) = m) " ^ 2 

for every m E N. Therefore F(S{n) ^ m) ^ 2^™"/^, and the second 
part follows immediately. □ 

Proof of Lemma [3 We apply a straightforward greedy algorithm. Tak- 
ing the vertices one by one, we claim that there is some set Bj, such 
that all vertices already in Bj are distance at least k + 1 from the vertex 
in question. Indeed, this follows immediately from the condition 

\B{x, k)\{x}\ = \{ye V{G) : d{x, y) ^ k}\ - 1 ^ m - 1. 

The greedy algorithm thus gives the required partition of V{G). □ 

Lemma [9] is an immediate consequence of Lemma [H 

Proof of Lemma Let us again consider the vertices of Qn as subsets 
of [n], and let x = 0. Given a vertex u G S{x, k), observe that the set 

{v e S{x,k) : d{u,v) ^2k-l} 

is exactly the set of A;-subsets of [n] which intersect v. There are at 
most k(^^"^^ such sets, and so the result follows by Lemma [HI □ 

Finally, we prove the two easy lemmas. 



42 



JOZSEF BALOGH, BELA BOLLOBAS, AND ROBERT MORRIS 



Proof of LemmalTR Let S"' = S" — 1 ~ Bin(n — l,p), and note that we 
may choose a couphng so that S" ^ S ^ S' . Now 

¥{S'{n)^m) = P(S"(n)^m-l) = (l + o(l))P ^ m) , 

since P(5"(n) = m) = o{F{S"{n) ^ m) ) by Observation ETK^). Thus 

P(S(n)^m) ^ ¥{S'{n)^m) ^ (l + o(l))P (S(n) ^ m) . 



□ 



Proof of Lemma [771 We have 

P A = 1 Lb ?7. ^ m = ^ 7 — — r 

^ I V ; / ; P(^(n) ^ m) 

F{S{n) ^m\X = l)P(X = l) 
F{S{n) ^ m) ' 

and F{S{n) ^ m | X = l) = (1 + o{l))F{S{n) ^ m) by Lemma OTOl so 
the lemma follows. □ 
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